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Cloning and sequence analysis of a novel transcription-associated zinc ribbon 
protein gene from silkworml] Bombyx mori 

FENG Shan'[] ZHANG Yao-Zhou”-[] 1. Institute of Biochemistry[] Zhejiang University Hangzhou 3100290 
China[] 2. Institute of Biochemistry] Zhejiang Sci-Tec University] Hangzhou 310018[] Chinal] 

Abstract[] The zinc ribbons are one group of diverse zinc finger proteins. Most zinc-ribbon domain is actually 
folded as three-stranded antiparallel B-sheets in their structure instead of finger-like helices. The zinc ribbon is 
a domain that functions as interaction modules binding to nucleic acids. The zinc ribbon proteins perform a 
broad range of functions such as replication and repair] transcription and translation as a transcriptional 
repressor. Through searching cDNA bank of silkworm pupaell a unique EST[] GenBank accession no. 
DY230964[] encoding the zinc ribbon protein was identified. In order to obtain full-length cDNA in Bombyx 
mori[] we used this sequence as a probe and performed in silico cloning based on the B. mori EST database. 
A contig was thus assembled on the basis of several ESTs. It was also verified by RT-PCR. PCR product was 
purified and ligated into pGEM-T-easy vector. The recombinant clones were sequenced[] which had the same 
sequence as the contig. We nominated this novel gene encoding zinc ribbon domain containing protein as 
BmZNRD 1] GenBank accession no. DQ432055[]. The full-length of the gene is 675 bp. It contains an ORF 
of 363 bpl 10 bp nucleotide sequence in 5’ UTH] untranslated region[] and 302 bp nucleotide sequence in 3’ 
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UTRL] untranslated region[]. It encodes 120 amino acids and is highly homologous to those of eukaryotic 


organisms previously reported] about 60%] with their predicted mass 13.54 kD and isoelectric point 6.8. 


The deduced amino acid sequence is one of zinc ribbon proteins with two putative functional domains[] the N- 


terminal domain Cx; Cx, Cx,;C and the C-terminal domain Cx,Cx,,Cx,C. The highly conserved amino acid 


sequence within the C-terminal domain is Cx, Cx, Yx, QxRSADEx, TxFx, Cx, CL] which is well conserved through 


evolution ]in organisms such as yeasts[] Drosophila[] nematodes[] amphibians and mammals[] and highly 


homologous to the yeast RNA polymerase A subunit 9 and transcription-associated proteins. An alignment of 


the cDNA sequence with the silkworm genome sequences revealed that there were 3 exons and 2 introns in the 
gene of BmZNRDIL] and all of them conformed the canonical GT-AG rules. 


Key words[] Bombyx mori[] zinc ribbon protein genel] in silico cloning[] gene cloning[] sequence analysis 


EG BG a Yk 
DODODOD0D DD OOO 000000000 
DODO DO ODO DD OOO DD 0000000 
DO O gd0d 000 O UO expressed sequence tagl 
ESTO BO DO DO DO DDD 000000 
O Adams et al. 19910 UU 000000000 
DODODODMOD0D00 000000000 
ee 0000-00 0000 
DOBDODO DDD D OOO DD 0000000 
O00000 ESTO O O E UO in silico cloning 
DOOD000 000 000000000000 
DOMOQO0D00 000000000 0000000 
0O00EeSTODO Doo o0 00 m0 0000000 
DODO DO 00 MODO DO DO Do ooo 
00000 EsSso0D0M000000000 cDNA 
O Ol Gill and Sanseaul] 200000 0 O 0 O O RT-PCR 
DODODOODO MD 0000 &strouddg 
DODO0 MODO 0000000000000 
gon d000000000000 00 O’Brien 
et al. 1200000 UU O 02002 UU UU UU 
00800000 0000000000000 
O Schaefer{] 1995[] Feng et al . 0 200400 O O  dbEST 
O ESTO 000000 E Prigent et al .0O 199800] 

DOBO DO DO DD 000000000 
OMQO0D00 000000000000 DNAD 
DODODOD DO DO DO DO DO 00000 
O moi 00000000000 00000 Miller 
etal.011985M00000000000000 
Z2"po0000000”00"D00000M00 
0000000000 OO zinc finger protein[] 
ZFPIO Evans and Hollenberg[] 198800 0 000000 
DODODO DO DD OD OD 00000 DNA 
DODd0 M0D0DO0D 0D 0D 0000000 


OO00000 DNAURNA] Liao et al .[] 1992[[] DNA- 
RNA Y O DUO Shi and Bere] 19500000000 
00000 Le et 4.8 1930000000000 
0000000000 
DODODO OD MD 000000000000 
DODODoOo0 00D 0000 Ch, U 
LU DO C,H,-like finger] O O O O O OU treble clef 
finge O OO zine ribbons O O OO Krishna et al .[] 
2030 0000000000 DNADUOUUO 
DODOO0D 00000 0D 00000000 
poog000000 »Dago00000000 
0400001100000 0.0000 
DODBO DO O DD MD ODD000 0000000 
DODO MODO 0D DD 0000 mM0oDo0oo 
DODO0DDO0 MD Do0D0 0000 000000 
000 DNRADUOUUUOUUUUUUUOUO 
pDoOo0000 

HOOUODOO Bombyx moi O cDNA D O O 
DODODODO DDD 00000000000 
ESTO OO 0000 O OBLASTO O GenBank O O 
O ESTOOOUQUOO00 EStTo00000000 
000000000 DNAD OOOO RT-PCR O 
DODODODO DDD ODO D00DO DNAD 
DODODOD DO DO DO DO DO MD O0DO 
0o000000 


1 DO000 


111 OHOUMOO0U00U 

d0xd000 1000003000000 
DODODdO0 DD 000000000 TG 00 
OB 00 0 OU M-MLV cDNA Synthesis kitllTaq DNA O 
0000 0 0 0 0 O 074 DNA O O O Triz O O O 


728 OOO Acta Entomologica Sinica 


49 [] 





RT-PCR O O O O Invitrogen J O [PolyATtract kit[]X- 
gal IPTGOA mpicilline O Promega [] Hood pHelix O 
O O Boehring Mannheim HO 000000 kid 
0000 K*8 0000000000000 00 
ABI Prism 3100-AD O ODO DOO0 00000 
0000 

1.2 DU 

1.2.1 DODOO RNADODO mRNAO DOOD 
HOU -soci 000000 00075%0 0 0 0 
DOODDDO DDD 000000000000 
000 Trld00000000 RNAD O O 
PolyATtract kit O MmRNADO O O O O Promega O O 
DoOo0000000 

1.2.2 cCDNAD 000000 mRNA ga O do O 
M-MLV cDNA Synthesis kit O cDNAQ UO 100 2 
OUWUOUUUO eDNAU UU T4 dDNADUUUO 
00000000 And 0D 000 pHelix OO 00 
0 cach OO UOUUUUUU O Tet OU Sambrook 
and Fritsch] 198911] O O U O IPTGLX-gal 0 O O O 
O AO 100 yg migo 1800 0187000000 


DODdDOgDD DO D 0000000 LB] Amp* O 
0000000000000000 37000 
00000 DNADO £coR1O Hindg O O DO 
0000 ABI Prism 3100-AD 00000000 
1.2.3 ODO0O0000000000000000 
0000 DNADHOUOOUOOUWODO0OO 
D000000000 ESTO OD GenBank O O O 
DY230964 0 DOO ceso OOUUOUOGUdCOOO 
000 DY230964 0000000 BLasTOo O00 
OO GenBakO OOO ESTOOoOuUguddodg 
OOOOOUO00U0DU0COCOO Esto 000 O 
Dodo 00000 0 000 00 contigADO O O O 
00000000000 BastrpgomnoDoo0o 
000000000000 contig 0 O cDNA 
O Keni] 2002700 0 0 00 contig 0000000 0 
OOOHOWUOWUOOUUOODUdUUOO 
OMOU0O0UO00UMOO0U0O0OU0UdUdO 
DOBOa DO ong gn DO DO 000000000 
00000000 10 





01 UUUUUUOUUUUUU 


Table 1 The main tools of analysis in bioinformatics 
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Fig. 1 The result of RT-PCR for zinc ribbon 000000 AaTrGODo00D0 10000000 
protein gene of Bombyx mori TADOD00 300 30000000000 
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Fig. 2 Nucleotide sequence and deduced amino acid sequence of the silkworm BmZNRD1 cDNA 
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O 0 0 O OD The coding sequence is shown in uppercase letters[] while the noncoding sequence is depicted in lowercase letters[] the stop coded] taal] is 


indicated by asterisk] * [[] the deduced amino acid sequence is shown in single-letter code below the second base of each codon[] the exons are divided by 


vertical lines and arrows at the boundary. 
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Fig. 3 Schematic representation of the silkworm BmZNRD1 gene 
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Table 2 The junction sequences and sizes of BmZNRD1 gene exons and introns in the silkworm[] Bombyx mori 






































O00 Intron OO00 0 Inton size] bp 5‘0 00 5’ splicing site 3000 3’ splicing site 
1 224 TGATA | et} aag tac |ag| ATTAT 
2 258 TGCAA [et atg tte [ag] ATACA 




















O Ntesdg aga dnd add ao ooo oo gado 0000000 000000000 OO Uppercase and lowercase letters indicate exon 


and intron sequences[] respectively[] the conserved splice donor and dinucleotide sequences are indicated in box. 
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Table 3 Probable promoter sequences of BmZNRD1 gene predicted by NNPP web server 
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Fig. 4 Alignment of the deduced amino acid sequence of BmZNRD1 with homologous proteins from other organisms 


RPA12-HUMAN[] [O O Homo sapiens[] GenBank accession no. Q9P1UO0[] RPA12-MACMU[T] O O Macaca mulattal] QSTMSO[] RPA12-RATIT] O Rattus 


norvegicus[] QOMFYS5[] RPA12-MOUSENO O Mus musculus Q791N71] BmZNRD100 O Bombyx moril] DQ432055. 


*900000 0 The identical 


amino acid JO 0O 00000000000 40000000 O0 Cys Black area indicates the four key functional amino acid§] Cys[ in the zinc finger 


domain. 
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